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Abstract. A nonrepetitive coloring of a path is a coloring of its vertices such 
that the sequence of colors along the path does not contain two identical, con- 
secutive blocks. The remarkable construction of Thue asserts that 3 colors are 
enough to color nonrepetitively paths of any length. A nonrepetitive coloring 
of a graph is a coloring of its vertices such that all simple paths are nonrepet- 
itively colored. Assume that each vertex v of a graph G has assigned a set 
(list) of colors L v . A coloring is chosen from {L V } V £V(G) if * ne color of each 
v belongs to L v . The Thue choice number of G, denoted by 7r;(G), is the min- 
imum k such that for any list assignment {L v } of G with each \L V \ ^ k there 
is a nonrepetitive coloring of G chosen from {L v }. Alon et al. (2002) proved 
that 7r;(G) = 0(A 2 ) for every graph G with maximum degree at most A. We 
propose an almost linear bound in A for trees, namely for any e > there is 
a constant c such that tt;(T) ^ cA 1 ^ 6 for every tree T with maximum degree 
A. The only lower bound for trees is given by a recent result of Fiorcnzi ct 
al. (2011) that for any A there is a tree T such that tt;(T) = Q ( lo ' g °f g a ) ■ We 
also show that if one allows repetitions in a coloring but still forbid 3 identical 
consecutive blocks of colors on any simple path, then a constant size of the 
lists allows to color any tree. 



1. INTRODUCTION 

A repetition of length h (h ^ 1) in a sequence is a subsequence of consecutive 
terms of the form: x\ . . -XhXi ...Xh- A sequence is nonrepetitive if it does not 
contain a repetition of any length. 

In 1906 Thue proved that there exist arbitrarily long nonrepetitive sequences over 
only 3 different symbols (see [HUH])- The method discovered by Thue is constructive 
and uses substitutions over a given set of symbols. Recently a completely different 
approach to creating long nonrepetitive sequences emerged (see [7]). Consider the 
following naive procedure: generate consecutive terms of a sequence by choosing 
symbols at random and every time a repetition occurs, erase the repeated block 
and continue. For instance, if the generated sequence is abebe, we must cancel the 
last two symbols, which brings us back to abc. By a simple counting one can prove 
that with positive probability the length of a constructed sequence exceeds any 
finite bound, provided the number of symbols is at least 4. This is slightly weaker 
than Thue's result, but the argument seems to be more flexible for adaptations 
to other settings. This approach leads e.g. to a very short proof (see [7J) that for 
every n ^ 1 and every sequence of sets L\, . . . , L„, each of size at least 4, there is 
a nonrepetitive sequence Si,...,s n where Si G Li (first proved with an enhanced 
Local Lemma in [5]). The analogous statement for lists of size 3 remains an exciting 
open problem. In this paper we make use of the above-mentioned approach to 
nonrepetitive colorings of trees. 

For a given graph G we denote by V(G) the set of vertices of G. A coloring 
function / : V(G) — > N is a nonrepetitive coloring of G if there is no repetition on 
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the color sequence of any simple path in G. The minimum number of colors used 
in a nonrepetitve coloring of G is called the Thue number of G and denoted by 
7r(G). The dependence between the Thue number and maximum degree of graphs 
is already quite well understood. 

Theorem 1 (Alon et al. pQ). For any graph G with maximum degree A there 
is a nonrepetitive coloring of G using at most 16A 2 colors. Moreover, for every 



any nonrepetitive coloring. 

The Thue number of any tree is at most 4 (see PQ). Kiindgen and Pelsmajer [5] 
proved that ir(G) ^ 12 for all outerplanar G, and tt(G) ^ 4 fc for all graphs G with 
tree-width at most k. Probably the most intriguing question in the area concerns 
planar graphs. 

Conjecture 2 (Grytczuk 2007 [5]). There is a constant such that n(G) ^ c, for 
all planar graphs G. 

Very recently Dujmovic et al. [5] showed n(G) = O(logn) for all planar G on n 
vertices. 

Now, we turn to the list-version of nonrepetititve colorings of graphs. This is 
an analog of the classical graph choosability introduced by Vizing [11] and inde- 
pendently by Erdos, Rubin and Taylor [3]. Given a graph G suppose that each 
v G V(G) has a preassigned set of colors L v . We call {L v } v£ y(Q) a list assignment 
of G, or just lists of G. A coloring / is chosen from {L v } if f(v) G L v for all 
v G V(G). The Thue choice number of G, denoted by tti(G), is the minimum k 
such that for any list assignment {L v } of G with each \L V \ ^ k there is a nonrepet- 
itive coloring of G chosen from {L v }. The upper bound from Theorem [T] works also 
in the list-setting, i.e., 7r;(G) ^ 16A 2 for all G with maximum degree A. As we 
mentioned 7r;(P„) ^ 4 for all paths P n and the problem whether 3 or 4 is the right 
bound remains open. The first significant difference between the Thue number and 
the Thue choice number has been proved recently for trees. 

Theorem 3 (Fiorenzi et al. [S]). For any constant c there is a tree T such that 



In fact one can extract from [5] that for any A > 1 there is a tree T with 717 (T) = 
^( lo'giog a )- We- propose two results complementary to Theorem [3] First is an 
improved upper bound for the Thue choice number of trees. 

Theorem 4. For every e > there is a constant c such that 7r;(T) ^ cA 1+£ for all 
trees T with maximum degree A. 

A sequence is of the form x r for real r ^ 1 if it can be divided into [~r] blocks 
where all the blocks but the last are the same, say x\ . . . x n for some n ^ 1, and the 
last block is the prefix of x\ . . . x n of size [frac(r) ■ n] , where frac(r) is the fractional 
part of r. The sequence x\ . . . x n repeated in those blocks is also called the base of 
the given sequence. For example any repetition is a sequence of the form x 2 and 
abedabedab is of the form x 2 - 5 with the base abed. A coloring of a graph G is a; r -free 
for real r > 1 if there is no sequence of the form x r among the color sequences of 
simple paths in G. Thus, an x 2 -free coloring is simply a nonrepetitive coloring while 
an x 3 -free coloring satisfies a weaker condition, in particular it allows a coloring to 
have a repetitions. A consequence of our second result is that for any tree T and 
lists {L v } each of size 8 there is an a; 3 -free coloring of T chosen from {L v }. This 
explains somehow the tightness of Theorem [3] 

Theorem 5. For every e > there is a constant c such that for every tree T and 
lists {L v } ve y(T) each of size c there is x 2+e -free coloring of T chosen from {L v }. 



A > 1 there is a graph with maximum 




n{T) > c. 



NONREPETITIVE CHOICE NUMBER OF TREES 



3 



2. Proofs 

In both proofs given a tree T we are going to fix an arbitrary vertex for a root 
and denote it by root(T). For u, v e V(T) we say that u is a descendant of v if the 
unique simple path from u to root(T) contains v. The set of all descendants of v, 
including v, is denoted by v\,. The depth(u) is the number of vertices on a simple 
path from v to root(T). A vertex u is a child of v if u is a descendant of v and they 
are adjacent in T. We also pick an arbitrary planar embedding of T. This means 
we fix an ordering of children of every vertex in T. If v has a child, the first child of 
v in a determined order is first-child(v). If u is a child of v, but not the last child, 
then next-child(u, it) is the child of v that is next to u. 

A vertical path in a rooted tree is a simple path whose first vertex is a descendant 
of the last or vice versa. A coloring of a rooted tree T is vertically x r -free for real 
r > 1 if there is no sequence of the form x r among the color sequences of vertical 
paths in T. 

For any planar embedding of a given rooted tree T and list assignment {iu} B gv(r)i 
a pair (/, u) is a partial coloring if u 6 V(T) and / is a partial function from V(T) 
to N defined only for the vertices of T up to u in the preorder traversal of T and 
f(v) e L(v), whenever f(v) defined. The set of all partial colorings of a tree T with 
fixed {L v } veV ( T ) is denoted by PCOL. 

Following usual convention wc define [n] to be {1, . . . , n}. For a set of integers 
A we use A + to denote the set of finite sequences over A of length at least 1. For 
s 6 A + and n £ N we write s ■ n to denote the sequence s with appended element 
n. For a sequence s = (si, . . . , s n ) we put s\..i = (s\, . . . , Si). 

Consider a coloring of a rooted tree with an a; 2+e -block on some simple path. 
Clearly, at least half of the vertices of this path forms a vertical path whose color 
sequence is of the form x 1+£ / 2 . Thus, Theorem [5] is an immediate consequence of 
the following lemma. 

Lemma 6. For every e > there is a constant c — 4 • \-~\ such that every rooted 
tree is vertically x 1+£ -free colorable from any lists of size c. 

Proof. For a given e > put c = 4 • \^] . Let T be a rooted tree and {L v } veV (T) be 
the list assignment with each |L(v)| = c. In order to get a contradiction, suppose 
that there is no vertically x 1+e -free coloring of T chosen from {L v }. Fix an arbitrary 
planar embedding of T. 

We propose a very naive procedure struggling to build a proper coloring of T 
from {L v }. The procedure maintains (f,v), a partial coloring of T from {L v }, 
with no color sequence of the form x 1+£ on any vertical paths other than paths 
going upwards from v. To start the procedure we just pick a color for root(T) 
from L(root(T)) and all other vertices are uncolored. Every consecutive step of 
the procedure tries to correct and/or extend the current partial coloring. This is 
encapsulated by the call of nextV((/, v), n) function (see Algorithm [T|), where (/, v) 
is the current partial coloring and n is the hint for the next decision to be made. 
The call of nextV checks first whether (/, v) is vertically x 1+£ -free. If not then 
the colors from vertices in the repeated e-part of x 1+e occurrence starting from v 
are erased (as well as colors of all descendants of erased vertices) and the color 
for the top-most vertex with erased color is set again to be the n-th color from 
its list. If (/, v) is vertically x 1+e -free, nextV((/, v), n) tries to extend the partial 
coloring (/, v) onto the consecutive subtrees of v. We will keep an invariant that 
any extension of an input partial coloring (/, v) onto all descendants of v contains 
a vertical x 1+e -block. We will extend (/, v) onto ul for u being consecutive childs 
of v and if u is the first child of v whose subtree cannot be colored in this way then 
nextV sets the color of u to be the n-th color from L(u). 
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Algorithm 1: nextV((/, 



if x 1+£ occurs in (/, v) starting from v on the way to root(T) then 
I = the length of the base of x 1+e sequence 
m = \l ■ e] 

(vi +m , . . . , vi) = the path starting from vi +m — v going upwards in T 
with f(vi) = f{vi +i ) for 1 < i < m 

U <r- Vi +1 

erase all values of f in u\, 
else 

u = first-child(u) 

while / has a vertically x 1+e -free extension onto u\. do 
extend / onto u\, in vertically a; 1+e -free manner 
u = ncxt-child(w, u) 
extend / with {u — > a}, where a is the n-th element of L(u) 
return (/, u) 



The partial function nextV : PCOL x [c] -> PCOL is defined by Algorithm [TJ 
Note that nextV((/, v),n) is well-dchncd for partial colorings (f,v) with 

(i) no color sequence of the form x 1+£ on a vertical path other than paths going 
upwards from v, and 

(ii) no a; 1+e -free extension of (/, v) onto vj.. 

Moreover, if (/', u) = nextV((/, v), n) then this new partial coloring also satisfies 



(i) and (ii) This allows us to iterate the calls of nextV. Note also that vertex u 
is determined only by (/, v), i.e. the first argument of nextV, while f'(u) is simply 
the n-th color in L(u). 

Now, we define recursively a function h : [c] + — > PCOL which captures the idea 
of our naive procedure trying to color T from {L v }. For s£ [c] + , l^n^c and a 
being the n-th color in L(root(T)) put 

h(n) = ({root(T) a}, root(T)), 
h(s ■ n) = nextV(/i(s), n). 

First of all note that h(s) is well-defined for all s G [c] + . Indeed, h(s) is explicitly 



constructed for all s of length 1 and it trivially satisfies (i) while (ii) holds as we 
supposed that there is no vertically x 1+£ -free coloring of T from {L v }. Now h(s -n) 
is well-defined as nextV is well-defined for partial colorings satisfying (I)]|(ii) and a 
new partial coloring also satisfies (I)]|(ii) 



It is convenient to see s 6 [c] + as a seed driving to a sequence of partial colorings 
of T: h(si), h{s\..2), h{s\..^), . . . , h(s). Now, we aim to get a concise description 
of this sequence. Let (fi,Vi) = h(si,,i) for 1 ^ i ^ \s\. We define chosen(s) = 
(fi( v i), ■ ■ ■ ; f\s\( v \s\))- bi other words, chosen(s) is a sequence of colors set by 
instruction [13] of Algorithm [T] in consecutive calls of nextV on a way to build h(s). 

Claim. The function chosen is injective. 

Proof of the Claim. Note that the length of chosen(s) is equal the length of s. To 
get a contradiction let s ^ s' be the shortest sequences for which chosen(s) = 
chosen(s'). Let n = \s\ = \s'\. By minimality of s,s' we have Sx. .(n-i) = s i (n-i)- 
The first n — 1 values of chosen(s) depend only on Sx..( n _x)j therefore they are the 
same for both sequences. Moreover, the last values of chosen(s) and chosen(s') 
are picked from the same list. By the construction of the procedure, the list is 
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determined by ft<(si..( n _i)) = /i(sj ( n _x)) or it is just I/(root(i)) in case when n = 1. 
Since s ^ s' they must differ on the last coordinate. It means that indices of the 
last colors in chosen(s) and chosen(s') on the list are different, and hence the colors 
are different. □ 

Let s G [c] + , (fi,Vi) = h(si..i) for all 1 < i ^ \s\. For 2 ^ i ^ \s\ we denote by 
U, m-i the evaluations of variables I, m in the (i — l)-th call to the procedure nextV 
(for some calls, they may be undefined). Then W(s) = (depth(wi), . . . , depth (i>i a i)) 
is a supporting walk of s. The walk contains two kind of steps: positive, when 
W(s)i = W(s)i-± + 1, and negative, when W(s)i < W(s)j_i. Positive steps occur 
when procedure nextV descends into a subtree, i.e. evaluates the case from line |2] to 
[P2l Negative steps correspond to the calls in which repeated part of an x 1+e -block 
in the partial assignment is erased (lines [2] to [7]). Let us suppose that the i-th step 
was negative. Note that just from W(s) we can decode the length of the erased 
block, i.e. the value of m^. This is exactly W{s)i-\ — W{s)i + 1. However, to decode 
the corresponding value U we need some additional information. All we know is that 
rrii = \h -e], which leaves [1/e] possible values for £j. Therefore we annotate every 
step of W(s) with a number from {0, . . . , [1/e] — 1}. The number is meaningful only 
for negative steps. Formally the annotation function A : [c] + — > {0, . . . , [1/e] — 1} + 
is defined as follows. For 1 ^ i ^ Isl, 



A(s)i = 



li — \rrii /e\ if i-th step is negative 
otherwise. 



Let s G [c]+ and (/, v) — h(s) then vcolors(s) is the sequence of colors on the path 
from root(T) to v in a partial coloring /. Thus, the last value in vcolors(s) is f{v). 

Finally we define a total encoding function Log : [c] + — > N + x X N + as 

Log(s) = (W(s),A(s),vcolors(s)). 

Claim. The function Log is injective. 

Proof. Let s G [c] + . First we show that Log(s) uniquely determines sequences 
vcolors(si..i) for all 1 ^ i ^ \s\. Recall that vcolors(s) is written explicitly in 
Log(s). 

Suppose vcolors(si. i) is already known and now we reconstruct vcolors(si..(i_i)). 
If the i-th step of W(s) is positive, i.e. W(s)i > W(s)i-% then the length of the path 
from the root to the current vertex increased by 1 in step i. Thus, vcolors(sx..(j_i)) 
is exactly the same as vcolors(si..j) but with the last color erased. If the i-th step of 
W(s) is negative then to, = W(s)i — W(s)j_i + 1 is the size of the repeated £-block 
and U — \m,i/e\ + A(s)i is the size of the base of a x 1+e sequence fixed in this step. 
Clearly, the last color in vcolors(si..i) is introduced in the i-th step and k colors 
before form a base of the x 1+£ sequence that was retracted. Let (a% , . . . , , j3) be 
the suffix of vcolors(si..i). Then veolors(si..(j_i)) is just vcolors(si..i) with the last 
color, namely /?, erased and sequence [a%, . . . , a m ) appended. 

Once we have sequences vcolors(si..,) for all 1 ^ i ^ \s\, we may simply read 
their last values to reconstruct chosen(s). Now, the previous Claim assures that 
chosen(s) uniquely determines s. □ 

Let us fix M G N. We are going to give a bound for the number of distinct Log(s) 
for s of length M based on the structure of Log(s). For s G [c] M , supporting walk 
W(s) is a sequence of M positive integers with W(s)i — W(s)j_i ^ 1. Now replace 
all negative steps W(s) i: W(s)i + ± with a sequence W(s)i, W(s)i + 1, W(s)i, W(s)i — 
1, W{s)i — 2, ... , W{s)i + \. It is easy to see that such an operation is reversible and 
it results in a sequence of positive integers of size at most 2M with all steps in 
{ — 1, 1}. The number of such sequences is well-known to be o(2 2M ). The number 
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of possible annotation sequences A(s) is bounded by [l/e] M . Finally, vcolors(s) 
is a sequence of colors which appear on some simple path starting from root(T) in 
a final partial coloring h(s). There are |V(T)| simple paths starting from root(T) 
and each of them has at most c' v ' T " possible color assignments. 

By the last Claim the number of distinct Log(s) for s £ [c] M is simply c M . 
On the other hand the upper bounds from obtained just now we get the following 
inequality 



C M ^ „ ( A M\ 





M 


e 


■( 



|V(T)|- C I V < T )I). 



For c = 4["~] this gives a contradiction for sufficiently large M . □ 

Proof of Theorem^ Clearly, it suffices to prove the theorem for small values of e. 
We fix any e £ (0, 1), and choose 8 so that it satisfies 1 + e = (note that 5 < |). 
We are going to prove a bit stronger statement. There is a constant c such that for 
every rooted tree T with maximum degree at most A and lists {L v } v£V ^ each of 
size at least cA 1+£ , there exists a coloring of T from {L v } with 

(1) no color sequences of the form x 2 on simple paths in T, and 

(2) no color sequences of the form x 1+s on vertical paths in T. 

Let c be sufficiently large integer (c 12 • ([4] + 1) will do). Let T be a tree and 
{L v } ve v(T) be a lists assignment with each |i(w)| = c ^ cA 1+£ . In order to get 



a contradiction, suppose that there is no coloring of T chosen from {L v } with (1) 
and [(2)] satisfied. Fix an arbitrary planar embedding of T 



Like in the proof of Lemma [6] we propose a procedure struggling to accomplish 



an impossible mission that is to produce a coloring of T from {L v } satisfying (1 



and |(2)| The procedure maintains (f,v) a partial coloring of T from {L v } with 



the only possible violations of (1) and |(2)| on paths starting at v. To start the 
procedure we just pick a color for root(T) from L(root(T)) and all other vertices are 
uncolored. Every consecutive step of the procedure tries to correct and/or extend 
the current partial coloring. This is encapsulated by the call of next((/, v), n) 
function (see Algorithm^, where (/, v) is the current partial coloring and n is the 
hint for the next decision to be made. The call of next checks first whether (/, v) 
is vertically a; 1+5 -free. If not then the colors from vertices in the repeated 5-part 
of x 1+s occurrence starting from v are erased (as well as colors of all descendants 
of erased vertices) and the color for the top-most vertex with color cleared is set 
again to be the n-th color from its list. If (/, v) is vertically x 1+<5 -free then next 
checks whether it is a; 2 -free (see lines IWTB1 of Algorithm^. If there is a path P 
with a color sequence of the form x 2 then it must start at v and next clears the 
colors along P up to the last vertex which is a predecessor of v or up to the vertex 
which finishes the repeated block of x 2 occurence. Again, the color of the top-most 
vertex with color cleared is set to be the n-th color from its list. Finally, if there is 



no violation of (1) and |(2)| then next((/, v), n) tries to extend the partial coloring 
(/, v) onto subtrees rooted at consecutive childs of v. We will keep an invariant 
that such an extension of an input partial coloring (/, v) can not be done, and if u 
is the first child of v whose subtree cannot be colored in this way then next sets 
the color of u to be the n-th color from L(u). 

The partial function next : PCOL x [c] — > PCOL is defined by Algorithm [5J 
Note that next((/, v), n) is well-defined for partial colorings (/, v) with 

(i) no color sequence of the form x 1+d on a vertical path other than paths going 
upwards from v, and 

(ii) no color sequence of the form x 2 on simple paths other than paths starting at 
v, and 
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Algorithm 2: next((/, v), n) 



if x 1+ occurs in (/, v) starting at v on the way to root(T) then 
I = the length of the base of x 1+s sequence 
m = \l ■ <5] 

(vi +m , . . . , vi) — the path starting at vi +rn = v going upwards in T 
with f(vi) = f{vi +i ) for 1 ^ i < m 

u <r- Vi +1 

erase all values of f in u\. 
else if x 2 occurs in (/, v) starting at v then 
(«2i, ■ • ■ , vi) = the path starting at v^i = v 

with f(vi) = f{vi+i) for 1 ^ i < I 
k = the least integer i such that v is a descendant of vi 
if k ^ I then 

| U <r- Vi + i 

else 

| U <r- V k +1 

erase all values of f in u\, 
else 

u = first-child(u) 

while / has an extension onto u\. satisfying (1) and |(2)| do 
extend / onto u\. and keep (1) and (2) satisfied 
u = next- child (v, u) 
extend / with {u —> a}, where a is the n-th element of L(u) 
return (/, u) 



(iii) no extension of (/, v) onto v\. preserving (1) and |(2)| 

Moreover, if next((f,v),n) exists then this new partial coloring also satisfies (i) 
(iii) This allows us to iterate the calls of next. 



Now, we define recursively function h 



PCOL which captures the idea 



of our naive procedure trying to color T from {L v }. For s £ [c] + , l^n^c and a 
being the n-th color in L(root(T)) put 

h(n) = ({root(T) ->• a}, root(T)), 

h(s ■ n) = next(/i(s), n). 

First of all note that h(s) is well-defined for all s £ [c] + . Indeed, h(s) is explicitly 
constructed for all s of length 1 and it trivially satisfies (i) and (ii) while (iii) | holds 



Now 



as we supposed that there is no coloring of T from {L v } satisfying (1) and (2) 
h(s ■ n) is well-defined as next is well-defined for partial colorings satisfying (i)|(iii) 
and a new partial coloring also satisfies (I)|(iii) ' 



Now, for given s G [c] + we aim to get a concise description of h(si), h{s\..2) 1 
h(si..3), . . . , h(s). Let (fi,Vi) — h(si..i) for 1 < i < \s\. We define chosen(s) = 
(/i(^i), . . . , f\ s \{v\ s \)). In other words (and exactly as in the proof of Lemma 15]), 
chosen(s) is a sequence of colors set by instruction [22] of Algorithmic] in consecutive 
calls of next on a way to build h(s). 

Claim. The function chosen is infective. 

Note that if (/', u) = next((/, v), n) is defined then vertex u is determined only by 
(/, v), i.e. the first argument of next, while f'(u) is simply the n-th color in L(u). 
That is why the proof of the claim above follows exactly the same lines as the proof 
of the corresponding claim in the proof of Lemma |5] 
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For a partial coloring (/, v) let vcolors((/, v)) be the color sequence in / on 
vertices from root(T) to v. In particular, the last color in vcolors(s) is simply f(v). 

Claim. The function vcolors is injective on partial colorings from the image of h. 

Proof. We are going to prove that for any two partial colorings (/, u), (/', v') from 
the image of h, if vcolors((/, v)) — vcolors((/', v')) then (/, v) = (f',v'). The proof 
goes by induction on the length of vcolors((/, v)). 

When the length of vcolors((/, v)) and so vcolors((/', v')) is 1 then v = v' = 
root(T). Thus, root(T) is the only vertex colored by / and /', and the statement 
is trivial. 

Suppose that vcolors((/, v))\ — vcolors((/', v'))\ = n and the claim holds for 
all shorter sequences. Since (/, v) and (f',v') are in the image of h there exist 
s,s' G [c] + such that h(s) = (f,v) and h(s') — (f',v r ). Let (fi,Vi) — /i(si...,) for 
1 < i ^ \s\ and = h(s[ A ) for 1 ^ i ^ \s'\. Let j be the least index such 

that depth(t)i) ^ n for j < i ^ \s\. Analogously, let j' be the least index such that 
depth(w^) ^ n for f < i ^ \s'\. Now, we need a basic property of Algorithm^ 
that is, if (<?' , u') = next(g, u) then the coloring of a path from root(T) to u' , with 
excluded u', is the same in g and g'. This implies that the color sequence from 
root(T) to Vj is the same in partial colorings h(si...i) for all j ^ i ^ |s|, which 
is just the prefix of vcolors((/, v)) of length n — 1. Analogously, a color sequence 
from root(T) to v'j, is the same in partial colorings h(s[ J for all j' ^ i ^ |s'|, 
which is just the prefix of vcolors((/', ?/)) = vcolors((/, v)) of length n — 1. In 
particular this means that vcolors (( fj, Vj)) — vcolors((/j, , v'ji))- By the induction 
hypothesis we get {fj,Vj) — {fj>,v'j,). Now we do know that partial colorings 
(fj+i, Vj+i) and v 'j'+i) are generated by the calls of next with the same 

first arguments. Note that Algorithm [5] is deterministic (in particular line I^D)) 
in a sense that for the same input it always generates the same output. Thus, 
we immediately get that Vj + \ = say w = Uj+i, and two partial colorings 

(/j'+ij (/j+ii differ at most with the color of w. By the definition of j 

and j', in all the consecutively built partial colorings (fi,Vi) for j < i < \s\, (//,i>i) 
for j' < i ^ \s'\ vertex w is on the path from root(T) to the current vertex, i.e. vi or 
v[, respectively. Moreover, all these partial colorings differ at most in the subtree of 
w. But the only vertex from u>! colored in the final colorings (i.e. (/, v) and (/', v')) 
is w itself. Finally, in both of these colorings a vertex w receives the same color 
which is at the end of vcolors((/, v)) = vcolors((/', v')). Thus, (/, v) — (/',«'). □ 

Again (as in the proof of Lemma [5]) we aim to get a concise description of all 
these partial colorings and then apply a double counting argument. For s £ [c] + , 
let (fi,Vi) = h(si,,,i) for all 1 ^ i ^ \s\. For 2 ^ i ^ \s\ we denote by k,ki the 
valuations of variables l,k in the (i — l)-th call to the procedure next (for some 
calls, they may be undefined). Define W(s) = (depth (i>i), . . . , depth(w| iS |)) to be a 
supporting walk of s. We distinguish three kind of steps (differences) in W{s) 

(a) positive, when W(s)i = W(s)i-i + 1, i.e. no obstruction occures in the i-th 
step and Algorithm [2] evaluates lines [T81I2T1 

(b) a; 1+5 -negative, when W(s)i ^ W{s)i-\ and color sequence of the form x 1+s is 
fixed in the i-th step; this corresponds to the evaluation of lines [2K3 

(c) x 2 -negative, when W(s)i ^ W(s)i-\ and color sequence of the form x 2 is fixed 
in i-th step; this corresponds to the evaluation of lines 

Additionally we put m t = W(s)i — W(s)i^i + 1. For x 1+l5 -negative steps, rtii 
corresponds to the value of variable m in the corresponding call to the procedure 
next. 
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This time we need three kinds of annotations enriching the information given in 
W(s) . The first is analogous to the one in the proof of Lemma[S]and helps to recover 
lengths of the base of the x 1+s sequence in x 1+l5 -negative steps. Suppose that the 
i-th step was x 1+l5 -negative. Note that just from W(s) we can decode the length of 
the repeated block, i.e. the value of variable m,-. However, to decode a corresponding 
value U we need some additional information. All we know is that mi = \U ■ S~\ , 
which leaves [1 / <5] possible values for k . Therefore we annotate every negative step 
with a number from {0, . . . , [1/(5] — 1} and use an extra value for all steps which 
are not x 1+<s -negative. The annotation function A : [c] + — > { — 1, 0, . . . , \l/5~\ — 1} + 
is defined as follows. For 1 < i < Isl, 



li — \rrii/5\ if i-th step is x 1+l5 -negative 
— 1 otherwise. 



The second annotation function will serve to recover basic information concerning 
the paths whose part was retracted in a; 2 -negative step. Suppose that the i-th step 
is x 2 -negative. We want to recover the values of k and ki set in lines and [TTJ which 
represents the half of length of the path forming a repetition and the position of the 
tip in this path. Note that mt = W{s)i-\ — W(s)i + 1 is equal to min(Zj, 2U — ki). 
Hence, we need information what is the difference between li and ki. For 1 ^ i ^ \s\ 
let 

j li — ki if i-th step is ir 2 -negative 
I whatever otherwise. 



B(a)i 



To get a more convenient description of function B, we make a list of important 
values of function B and encode it into a sequence over { — 1,0,1}. If the i-th 
step is a; 2 -negative then we convert B(s)i into a sequence of 0's of length mi — 
W(s)i — W(s)i-i + 1 and if B(s)i ^Owe put sgn(B(s)i) in \B(s)i \-th position. We 
need to argue here that |i?(s)i| ^ to,. Indeed, as the partial coloring in the i-th 
step has no x 1+s occurrence we get that \U — ki\ ^ 8U and — nii ^ Sli, which give 

\B(s)i\ = \k - h\ Sli ^ -mi TO,. 

l—o 

The last inequality holds as 8 < \. We define B* (s) to be the concatenation of the 
sequences produced for all x 2 -negative steps. 

The third annotation contains the further description of the paths involved in x 2 - 
negative steps. Suppose that the i-th step is x 2 -negative and let P = (v2i i7 ■ ■ ■ , v\) 
be the path whose color sequence forms a repetition. Already from W(s) and B*(s) 
we will recover the size of the path and the value of ki such that Vk { is the tip of 
P. Now, we want to describe the way in which P goes down in T from up to 
v\. Let rij for 1 < j ^ k — 1 be the position of on the list of children of vertex 
Vj. Then put C(i) = (m, . . . ^n^-i) and C*(s) be the concatenation of C(i)'s for 
i being the indices of a; 2 -negative steps. 

A total encoding function is defined as Log(s) = (W(s),A(s),B*(s), C*(s), h(s)) 
for s e [c]+. Length of a Log(s) = (W (s) , A(s) , B* (s) , C* (s) , h(s)) is defined to be 
the length of W(S), hence |Log(s)| = |s|. Here comes the key property of Log 
function. 

Claim. The function Log is infective. 

Proof. Take any L from the image of Log. Suppose that \L\ = n. Then, there exists 
s £ [cf such that Log(s) = L. We are going to show that there is only one such s. 
We prove that reconstructing the sequence chosen(s) from L. This will prove the 
claim as we already know that chosen(s) is injective. 
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Let s' be the prefix of s of size n — 1. In one step of reconstruction we decode 
from L the last chosen color a and the value of Log(s'). Then, by simple iteration 
of this process, we reconstruct the whole chosen(s). The value of a may be simply 
read from h(s), which is explicitly given in Log(s). In order to get Log(s') note 
that W(s') and A(s') are just the prefixes of W(s) and A(s) of length \s\ — 1. It 
remains to reconstruct h(s'), B*(s') and C*(s'). The way we proceed depends on 
the type of the last step in W(s), which can be recognized from W(s) itself and 
A(s). Indeed, if W(s) n = W(s) n -i + 1, then the last step is positive. Otherwise 
the value of A(s) indicates which type of negative step we deal with. 
Cases 1 and 2. The last step in W(s) is positive or a; 1+5 -negative. Then B*(s r ) = 
B*(s), C*(s') = C*(s). The partial coloring h(s') is reconstructed exactly as in the 
analogous cases in the proof of Lemma [6] 

Case 3. The last step in W(s) is x 2 -negative. Let P = (i>2/„ , . . . , Wi) be the 
path whose color sequence forms a repetition and let Vk n be the tip of P. The 
number of vertices in P with colors erased can be read from W(s) and it is m„ = 
W(s) n — W(s) n -i + 1. By the construction of the Algorithm l2l f lines IMT6|) we have 

2l n — m n = max(^ n , k n ). 

From the last m n values of sequence B*(s) we can extract the value of l n ~k n . If 
all these values are zeros then l n — k n = 0. Otherwise exactly one of these m n values 
is equal to 1 or —1 and the position of this non-zero value determines \l n — k n \ while 
the sign of l n — k n is the same as the sign of this non-zero entry. Once we know 
d = l n — k n we can deduce that 

l n = m n and k n — m n — d, if d = l n — k n ^ 0, 

l n = 77i„ — d and k n = m n — 2d, if d = l n — k n < 0. 

Let h(s) = [f,u), h(s') — (/',«'). As we supposed that call of nextV generating 
h(s) from h(s') retracts a repetition on path P, we get that v! = vii n and u — 
i'2/„-m„+i- The color of u = U2i„-m„+i in /' was erased by line [TBI and replaced 
in line [22] of Algorithm [2 The colors of i | 2/„-m„+i, • • • , Vsi n were erased from /' 
and are not visible in / but the colors of V\,. .. ,V2i n - m „ remain unchanged. The 
vertex V2i n — m „ is clearly the parent of u. As we already reconstructed the value of 
kji, i.e. the position of the tip of P, we know the vertices of P lying on a path from 
V2i n -m n to root(T). In particular, we reconstructed the vertex Vk n in T. Now, 
we make use of C*(s). The last k n — 1 values of C*(s) indicates how the path 
P goes down in T from Vk n up tp v\. This way we reconstructed the position of 
(i'2/„-m„, ■ ■ ■ ,Vi) in T and we know that their colors are the same in / and /'. 
Once we know the colors of at least first half of the vertices of P (as m n ^ l n ) and 
as the color sequence of vertices from P forms a repetition in /' we may deduce the 
colors of u 2 ;„- TOn +i, ■ • • , in- 
putting all together we finally reconstruct vcolors(/i(s')) which is the sequence 
of colors in /' from root(T) down to u' — V2i n - Indeed, the colors from root(T) 
down to V2i n -m n are the same in /' and /, while the colors from f2/„-m„+i to i>2/„ 
has just been reconstructed. Now, recall that the function vcolors is injective on 
the partial colorings from the image of h which means that we can reconstruct from 
vcolors(/i(s')) a partial coloring h(s') itself. □ 

We are going to bound the number of distinct Log(s) for s of length M. For 
every s € [c] M we have Log(s) = (W(s),A(s),B*(s),C*(s),h(s)). Just like before, 
the number of integer walks W(s) of length M is o(4 M ). The number of possible 
annotation sequences A(s) is bounded by (|~l/<f| + l) M . The annotation B*(s) is 
a sequence of numbers { — 1, 0, 1} of length J2i m h where i goes over all the indices 
of .x 1+l5 -negative steps. Clearly, X)i m » ^ M and so the number of distinct B*(s) 
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is bounded by 3 M . The annotation C*(s) is the concatenation of sequences over 
{1, . . . , A — 1}. The length of C*(s) is equal to J2 i fcj, where the sum goes over the 
set I x 2 of all the indices of x 2 -negative steps. Clearly, 

^ ^(l + ^<^ T ^m i < T f|M=(l + £ )M, 
»e/ x 2 ie/ x 2 »=i 

By the last Claim the number of distinct Log(s) for s 6 [c] M is simply c M 
(cA)( 1+£ ' M . On the other hand we just obtained an independent upper bound and 
altogether we get the following inequality 

/ r 1 1 \ M 

( c A)( 1 + £ ) M ^o(4 M )-[ i +1] -3 M -A( 1+£ ) M - (|y(T)| c l y ( T )l). 
For c > 12 • ([^] + 1) and sufficiently large M we get a contradiction. □ 
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